Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
upwork.com π’ 2026-05-14
πΉ Import & Clean Public Permit Data into Structured Schema
π€ Client: πΊπΈ USA Member since 2025-01-14
π° Price: ****
π© Problem: Gathering and structuring raw permit data for analysis.
π¦ Existing: Not specified
Specifications:
[Target] Extract permit data from public sources (files or portals).
[Method] Automate data extraction using web scraping tools.
[UI/UX] Not applicable.
[Stack] Python, BeautifulSoup, Pandas, PostgreSQL/Supabase.
[Security] Ensure compliance with data privacy laws and secure storage.
[Format] Output as clean CSV or directly into PostgreSQL/Supabase.
Workflow:
1. Analyze sample raw permit files to understand structure and fields.
2. Develop web scraping scripts using BeautifulSoup to extract relevant data.
3. Clean and normalize extracted fields (addresses, dates, costs).
4. Map cleaned data into the target schema: permit_number, street_address, city, state, zip, contractor_name, work_category, record_type, permit_issue_date, estimate_cost, record_status, source_url, bed, bath, sqft, leftover_data, description.
5. Validate and test script with a sample dataset.
6. Document the process for future runs (monthly).
7. Expand to other datasets iteratively.